Oncolmmunology 3, e28836; May 2014; © 2014 Landes Bioscience 



POINT-OF-VIEW 



High sensitivity of cancer exome-based CD8 T cell neo-antigen 
identification 

Mark M van Buuren, Jorg JA Calis, and Ton NM Schumacher* 

Department of Immunology; The Netherlands Cancer Institute; Amsterdam, The Netherlands 



Keywords: epitope prediction, immune 
monitoring, neo-antigens, tumor vaccine, 
whole exome sequencing 

Abbreviations: BLAST, basic local 
alignment search tool; CTLA4, 
cytotoxic T-lymphocyte antigen 4; 
MUM2, multiple myeloma protein 
2; NSCLC, non-small-cell lung 
cancer; PD1, programmed cell death 
protein 1; PDL1, PCD ligand 1 (PD- 
Ll); PMBEC, peptide:MHC binding 
energy covariance; SCLC, small-cell lung 
cancer; SNV, single nucleotide variant; 
TCR, T-cell receptor; TIL, tumor 
infiltrating lymphocyte 

•Correspondence to: Ton N.M. Schumacher; 
Email: t.schumacher@nki.nl 

Submitted: 04/03/2014 

Accepted: 04/09/2014 

Published Online: 05/14/2014 

Citation: van Buuren MM, Calis JA, Schumacher 
TNM. High sensitivity of cancer exome- 
based CD8Tcell neo-antigen identification. 
Oncolmmunology 2014; 3:e28836; http:// 
dx.doi.org/l0.4161/onci.28836 



Recent data suggest that T-cell reac- 
tivity against tumor-specific neo- 
antigens may be central to the clinical 
efficacy of cancer immunotherapy. 
The development of personalized vac- 
cines designed to boost T-cell reactiv- 
ity against patient specific neo-antigens 
has been proposed largely on the basis of 
these findings. Work from several groups 
has demonstrated that novel tumor-spe- 
cific antigens can be discovered through 
the use of cancer exome sequencing data, 
thereby providing a potential pipeline for 
the development of patient-specific vac- 
cines. Importantly though, it has not been 
established which fraction of cancer neo- 
antigens that can be recognized by CD8* 
T cells is successfully uncovered with the 
current exome-based epitope prediction 
strategies. Here, we use a data set com- 
prising human cancer neo-antigens that 
was previously identified through the use 
of unbiased, computational-independent 
strategies to describe the potential of can- 
cer exome-based neo-antigen discovery. 
This analysis shows a high sensitivity 
of exome-guided neo-antigen predic- 
tion of approximately 70%. We propose 
that future research should focus on the 
analysis and optimization of the specific- 
ity of neo-antigen prediction, and should 
undoubtedly entail the clinical evalu- 
ation of patient-specific vaccines with 
the aim of inducing immunoreactivity 
against tumor-displayed neo-antigens in 
a physiologically relevant context. 



T-cell Immunity Directed 
toward Human Tumors 

The capacity of immune cells to rec- 
ognize, and potentially eradicate, dis- 
tinct types of human tumors has been 
proven. The most recent — and compel- 
ling — evidence comes from a number 
of therapeutic studies using antibodies 
directed against T-cell checkpoint mol- 
ecules. Specifically, administration of 
anti-cytotoxic T-lymphocyte antigen 4 
(anti-CTLA4) to advanced melanoma 
patients has been shown to improve 
overall survival, 1 with long-term clinical 
responses in 10-20% of treated patients. 2 
Furthermore, striking results have been 
obtained in early phase studies assessing 
the effects of blocking antibodies directed 
against the T cell survival regulatory 
programmed cell death 1 (PD1) — pro- 
grammed cell death 1 ligand 1 (PD-L1) 
axis, revealing objective clinical responses 
in around 30% of melanoma patients. 3,4 
Finally, early evidence for synergy 
between PD1 and CTLA4 therapeutic 
targeting has also been obtained. 5 So far 
it has proven difficult to determine pre- 
cisely which immune cell subset, or sub- 
sets, drives the apparent clinical responses 
to T-cell checkpoint blockade therapy. 
However, a study by Dudley and col- 
leagues has shown that cytotoxic T cells 
are responsible for at least some of the 
clinical activity of melanoma immuno- 
therapy by tumor regression in response 
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to adoptive transfer of autologous and ex 
vivo enriched CD8* T cells. 6 

Whereas the clinical activity of anti- 
CTLA4 antibody treatment has, thus far, 
only been convincingly demonstrated for 
the treatment of metastatic melanoma, 
anti-PDl/ anti-PD-Ll treatment has also 
been shown to exhibit clinical activities 
in patients with a variety of tumor types, 
such as ovarian cancer, renal cell cancer 
and non-small-cell lung cancer (NSCLC) 
(ranging from 6-33% of patients). 7 
Interestingly, the observation that clinical 
activity of PD1/ PD-L1 targeting antibod- 
ies is prominent in patients with mela- 
noma and NSCLC, both characterized 
by very high mutation loads, is consistent 
with the hypothesis that T cell recognition 
of tumor-specific neo-antigens — antigens 
that appear on the tumor as a consequence 
of a tumor-specific mutation — may be of 
particular importance. 8 In line with this 
hypothesis, PD1 blockade has shown rela- 
tively higher clinical activity in smoking- 
associated NSCLC, a condition in which 
mutation loads are particularly high, vs. 
non-smoking associated NSCLC, with 
patient response rates of 26% and 10%, 
respectively. 7 

On the basis of these observations, 
as well as the fact that T-cell reactiv- 
ity against neo-antigens should not be 
negatively affected by central tolerance, 9 
and further, should not induce toxicity 
against healthy tissues, the development 
of vaccines that enhance neo-antigen spe- 
cific T-cell reactivity is considered attrac- 
tive. An important consideration in the 
development of such vaccines is that the 
majority of mutations in human tumors 
are 'passengers' that are essentially unique 
to individual patients. As such, molecu- 
larly defined vaccines aiming to induce or 
boost neo-antigen specific T-cell reactivity 
should be developed in a patient-specific 
manner. 

The possibility that tumor-specific 
mutations could result in the presentation 
of peptides recognized as foreign by the 
autologous CD8* T-cell repertoire has for 
long intrigued cancer immunologists. 10 " 12 
Seminal work by Wolfel and colleagues, 
for instance, demonstrated that T-cell rec- 
ognition of mutated epitopes in human 
melanoma could dominate the tumor- 
specific T-cell response. 12 However, the 



experimental approach used in these early 
experiments was substantially too involved 
to describe tumor-specific neo-antigens in 
large cohorts of patients. Furthermore, 
within these experiments, pre-existent 
reactivity of autologous T cells formed the 
experimental basis for neo-antigen identi- 
fication. As such, antigen identification by 
this approach is restricted to those tumor- 
specific neo-antigens for which T-cell 
reactivity is already present, thereby ignor- 
ing the possible repertoire of neo-antigens 
against which T-cell reactivity could be 
induced by vaccination. 

The development of next generation 
sequencing technology has permitted 
the analysis of potential neo-antigens 
expressed in human tumors by a funda- 
mentally different approach, a method 
that could potentially be transformed into 
a high-throughput, standardized process. 
Sahin and colleagues were the first inves- 
tigators to demonstrate how whole-exome 
sequencing data could be used to identify 
neo-antigens that could be recognized by 
CD8 + T cells in the B16F10 murine mela- 
noma models. Subsequent analysis of the 
immunogenicity of the predicted neo- 
antigens by peptide vaccination showed 
that a marked proportion of these peptides 
were able to elicit an immune response, 
and, that a fraction of the induced T-cell 
populations were able to recognize B16F10 
tumors. 13 In parallel work, Schreiber and 
colleagues utilized exome-guided analy- 
sis of a carcinogen-induced mouse tumor 
to identify a neo-antigen recognized 
by endogenous T cells during in vivo 
tumor outgrowth. This work additionally 
showed that such a neo-antigen could be 
lost under immune pressure, 14 providing a 
clear rationale for the targeting of multiple 
neo-antigens in parallel. 8 

Work from our group as well as from 
the National Institute of Health Surgery 
Branch has brought the concept of can- 
cer exome-based neo-antigen discovery 
to the analysis of human malignancies. In 
both studies, cancer exome information 
was used to identify mutations in human 
melanomas, and T-cell reactivity against 
some of the identified potential epitopes 
was evaluated. 1516 In the work of Robbins 
and colleagues, neo-antigen specific T-cell 
responses were observed in four out of five 
patients analyzed, with the side-note that 



this analysis focused on patients with a 
clinical response upon tumor-infiltrating 
lymphocyte (TIL) adoptive-transfer ther- 
apy. 15 In our study, we initially analyzed 
the TILs of a melanoma patient who 
responded to anti-CTLA4 treatment. In 
this patient, exome-guided analysis of 
tumor cells and autologous healthy tissue 
resulted in the identification of two neo- 
antigen-specific T-cell responses within 
the TIL product. Notably, one of these 
neo-antigen-specific T-cell responses 
was also detected in the peripheral blood 
of this patient, and increased markedly 
after the start of anti-CTLA4 treatment, 
thereby providing the first evidence that 
neo-antigens identified on the basis of 
cancer exome sequencing data can be used 
to assess the effects of immunotherapeutic 
interventions. 16 With an additional seven 
melanoma patients analyzed, we have 
now been able to demonstrate neo-anti- 
gen specific T-cell responses in six out of 
eight patients tested (range from 0.002% 
- 65% of CD8* T cells at the tumor site; 
van Buuren et al., unpublished results). 
Furthermore, in recent work, Nelson and 
colleagues used cancer exome data to 
reveal neo-antigen reactivity in one out 
of three patients with ovarian cancer. 17 
Collectively, these results indicate that 
neo-antigen specific T-cell reactivity is a 
common phenomenon in melanoma and 
can also be detected in other tumor types 
with substantially lower mutation loads, 
such as in ovarian cancer. 

Bearing the aforementioned data in 
mind, the development of patient-specific 
vaccines designed to boost or induce T-cell 
responses targeting mutated epitopes 
appears highly attractive. Importantly 
though, while the above data demon- 
strate that T-cell responses against neo- 
antigens can readily be identified on the 
basis of cancer exome sequencing, these 
data reveal neither the specificity nor the 
sensitivity of this approach. In order to 
develop clinically relevant vaccines, par- 
ticularly for tumors with lower mutation 
loads, it will be important that such vac- 
cines do not leave out a large fraction of 
the true T-cell reactive neo-antigens (i.e., 
sensitivity must be sufficient). At the same 
time, it will be equally important that 
these vaccines are not primarily composed 
of epitopes that are incorrectly predicted 
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as possible neo-antigens, (i.e., specificity 
must be sufficient). Here, we address the 
first of these two issues, by defining the 
sensitivity of cancer exome-based neo- 
antigen identification on the basis of an 
independently obtained data set. 

What is the Sensitivity of Cancer 
Exome-Based Neo-Antigen 
Prediction? 

In order to assess the sensitivity of 
cancer exome-based neo-antigen predic- 
tion, we assembled a data set of human 
cancer neo-antigens that had been previ- 
ously discovered by unbiased and non- 
computational strategies. To this end, we 
performed an extensive literature search 
that led us to the identification of 17 pre- 
viously reported neo-antigens that met 
the following criteria: (1) Evidence of 
natural processing and presentation of the 
neo-antigen, as shown by recognition of 
tumor cells by autologous T cells; (2) The 
identified neo-peptides were the result of 
a single nucleotide variant (SNV); and 
(3) Unbiased epitome identification 
method, without the use of cancer exome 
data or major histocompatibility complex 
(MHC) pathway predictors. The major- 
ity of the epitopes that fulfilled these cri- 
teria had been previously discovered by 
the expression-based screening of tumor- 
derived cDNA libraries with tumor-reac- 
tive T cells. A large majority was derived 
from melanoma patients (n = 10), but 
neo-antigens originating from NSCLC, 
small-cell lung cancer (SCLC), bladder 
cancer and renal cell cancer were also 
identified (Table 1). 

For analysis of this set of neo-antigens, 
the parental protein sequences were down- 
loaded from NCBI, after a basic local 
alignment search (BLAST) on human 
coding sequences using the non-mutated 
peptide. The parental protein sequence 
was then altered to include the mutated 
amino acid, and subsequently, these pro- 
tein sequences were used in our prediction 
pipeline. As based on our prior analyses, 
epitopes were considered as discoverable 
by exome-based analysis when meeting 
the following criteria: 16 (1) Sufficient cov- 
erage of the encoding sequence within 
cancer exome data to allow calling of the 



mutant base. For this purpose, sequence 
coverage for each neo-epitope was ana- 
lyzed for three representative data sets of 
tumor tissue and matched healthy control 
tissue, using a cutoff of 5 reads or more. 
(2) NetChop cleavage probability of 0.5 
or higher 18 ; (3) NetMHCpan predicted 
binding affinity of 500 nM or less 19 ; and 
(4). A low 'similarity-to-self ', as based on 
a test that evaluates the likelihood that 
the T-cell repertoire can distinguish the 
mutant and wild-type epitopes, either 
by altered levels of epitope presentation, 
or by an altered structure of the MHC- 
presented epitope. In this analysis, simi- 
larity to self is scored in two ways. First, 
the core region of a potential neo-epitope 
is defined as the peptide sequence between 
the two anchor residues and is considered 
to be the T-cell receptor (TCR) -exposed 
surface. 5 ' 20 If this region is considerably 
different from the core-region of the 
parental sequence (amino acid change has 
a peptide: MHC binding energy covari- 
ance (PMBEC) value of <0.05 21 ), the 
mutant peptide is considered non-similar- 
to-self and is thus retained. Alternatively, 
if the parental sequence is not predicted 
to be an MHC-ligand (Netchop cleavage 
probability lower than 0.5 or predicted 
binding affinity higher than 500 nM), the 
peptide is also considered non-similar-to- 
self and retained. 

Analysis of the 17 previously discov- 
ered neo-antigens by this in silico analysis 
pipeline reveals that the current technol- 
ogy for the initial step in exome-based 
neo-epitope prediction, the cancer exome 
sequencing itself, requires little if any 
improvement. Specifically, read coverage 
of the gene sequence in which the muta- 
tions were originally discovered was suf- 
ficient for all three tumors for which this 
was tested for 16 out of the 17 previously 
described neo-epitopes. Read numbers 
for the sequence covering the remaining 
previously identified mutation (multiple 
myeloma protein 2; MUM2 22 ) was suffi- 
cient for 2 out of 3 tumors. With respect 
to the subsequent epitope predictions, the 
use of a 500 nM predicted binding affin- 
ity as a cut-off resulted in the correct iden- 
tification of 14 out of 17 neo-epitopes, 
with two of the failures occurring for 
HLA alleles for which the prediction algo- 
rithm still requires further optimization 



(HLA-B*52:01 and HLA-C*06:02). 19 
Also, 15 out of 17 epitopes correctly passed 
the NetChop filter, and 15 out of 17 cor- 
rectly passed the self-similarity filter. The 
combination of these different thresholds 
shows that 12 out of 17 neo-antigens would 
have been identified in two out of three 
cases, and 11 out of 17 neo-antigens would 
have been correctly identified in all 3 cases 
(Table 1), resulting in an overall sensitivity 
of 69%. It is noted that there may be cases 
in which neo-antigens are derived from 
alternative open-reading frames that are 
not taken into account in epitope predic- 
tions, or from intronic sequences that are 
not spanned by exome sequencing strate- 
gies. The former is unlikely to be a major 
confounder considering that such epitopes 
would also have appeared within the unbi- 
ased data set assembled here had they been 
common. The potential relevance of the 
second subset, intron-derived neo-anti- 
gens, is presently more difficult to gauge, 
as such mutations may also not be effi- 
ciently identified in cDNA-based screen- 
ing methods if these transcript are rare. 
On a related note, identification of indels 
within sequencing data has, thus far, been 
less reliable than identification of SNVs, 
and while typically occurring infrequently 
in certain tumor types, such as melanoma, 
failure to correctly identify all indels may 
reduce the sensitivity of neo-antigen pre- 
diction in those tumors in which these 
alterations are more prevalent. 

Most importantly, overall the cur- 
rent data indicate that the sensitivity of 
neo-epitope prediction from analysis of 
exonic SNVs in cancer exome sequenc- 
ing data requires little improvement. It is 
important to emphasize the necessity to 
also describe the specificity of these epi- 
tope predictions in future work. Using 
the above-described prediction pipeline, 
endogenous T-cell responses have, so far, 
only been observed against a relatively 
small fraction of the predicted neo-epi- 
topes (van Buuren et al., unpublished). 
These experiments do not, however, reveal 
whether this low 'hit rate' is due to a large 
fraction of predicted neo-epitopes that are 
false positives, or alternatively, reflects an 
inability of the immune system to gener- 
ate an immune response against all truly 
presented neo-epitopes during natural dis- 
ease progression. Analysis of the potential 
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occurrence of such a 'neglected' neo-epit- 
ope repertoire is an important outstanding 
question remaining to be addressed. 

The data described here convincingly 
demonstrate how exome-guided analy- 
sis can be utilized to predict potential 
neo-antigen specific T-cell responses in 
patients, and, together with data from Wu 
and colleagues, suggest that this method 
will help to identify suitable candidates 
for the creation of patient-specific vac- 
cines. 23 A recent analysis by Stratton and 
colleagues shows that while melanoma has 
the highest mutational load of all com- 
mon human cancers, the range of muta- 
tion load overlaps significantly with that 
of many other common tumor types. 24 
Assuming that the mutation load within 
tumors strongly correlates with the resul- 
tant production of neo-epitopes that can 
potentially be recognized by T cells, these 
data suggest that exome-guided analysis 
of neo-antigen specific T-cell responses 
will be relevant to a variety of human 
malignancies. With clear evidence for the 
clinical activity of a number of immuno- 
therapies and with emerging evidence for 
a role of patient specific neo-antigens in 
these therapies, it is tempting to speculate 
that the combination of anti-checkpoint 
antibodies with patient-specific vaccines 
will provide further patient benefit. 
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